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(54) Aligning masks and wafers in the 
manufacture of semiconductor devices 

(57} Method of discriminating stillness 
of a step exposure apparatus is char- 
acterized by the steps of causing a mask 
and wafer to step relative to each other, 
repetitively detecting the relative dis- 
placement (degree of out-of-alignment) 
between the mask and wafer, proces- 
sing to compute a value representing a 
variation in the displacement measure- 
ments, and comparing the computed 
value with a critical value relating to the 
attenuation of vibration resulting from 
the stepping motion to instruct the 
computation of the amount of further 
relative movement to be effected be- 
tween the mask and wafer if the mea- 
surement is smalierthan the critical 
value (and assuming the mask and 
wafer are not yet properly aligned) or to 
instruct the re-computation of the value 
if it is larger than the critical value. 
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SPECIFICATION 

Method and device for discriminating stillness of a 
step exposure apparatus 

5 

Background of the invent/on 
Field of the invent/on 

The present invention relates to a method and 
device for discriminating stillness of a semiconduc- 
10 tor exposure apparatus, particularly a step-and- 
repeat projection alignment and exposure apparatus 
(stepper) using a reduction optical system. 

Description of t fie prior art 

15 Particularly, the present invention prevents the 
computation of the degree of alignment from being 
adversely affected by the relative movement be- 
tween a photo-mask and wafer which is produced by 
an oscillation due to the step-and-repeat motion in a 

20 stage. The present invention further prevents a 
variation in detected signals due to various factors 
such as the reflectivity of the wafer, the surface 
condition of a photo-register and others from provid- 
ing an indifinite time required to attain the desired 

25 alignment or degrading the accuracy of the align- 
ment. 

In general, the semiconductor exposure apparatus 
(mask aligner) is used for exposing a wafer to the 
actual element pattern of a mask which functions as 
30 an original. Such a semiconductor exposure appar- 
atus utilizes either of a whole surface pattern 
exposure process characterized by a single shot, or a 
repetitive (divided) exposure process charactenzed 
by a plurality of shots. The whole surface pattern 
35 exposure process is used in cases where a wafer is 
exposed in contact with a mask (contact method), 
where a wafer is exposed, without using any imag- 
ing system, with the wafer being spaced away from 
a mask by a very small distance (proximity method), 
40 where a wafer is maintained spaced away from a 
mask by a sufficient distance such that they will 
always be disposed in such a relationship that the 
pattern on the mask can be imaged on the wafer 
through an one-to-one imaging system including 

45 lenses or mirrors (one-to-one projection method), 
and so on. The whole surface pattern exposure 
process exposes the wafer to the actual element 
pattern on the mask at a time. If it is desired to form a 
very compactly integrated element, however, the 

50 whole surface pattern involves a difficulty in manu- 
facturing a mask, since it has to have, at the unit 
scale a very fine pattern to which a wafer is to be 
exposed. 

In view of this prooiem, the repetitive exposure 
55 process has been proposed in which a wafer is 
exposec at its effective area to a pattern on a mask 
th'-ough a plurality cf exposure shots by expos ng 
the wafer to the pattern of the mask through a 
reduction projection lens system having its magmf 
60 cation smallerthan one and moving the wafer and 
mask relative to each other such that the exposed 
regions on the wafe- will not be overlapped one on 
another throughout the exposure procedure. In this 
repetitive exposure process utilizing the reduction 
65 projection lens svstem. the pattern on the mask can 



GB 2 131 205 A 



be enlarged by a factor of the inverse number of the 
magnification in the imaging system. Therefore, the 
above difficulty associated with the manufacture of 
the mask can be decreased. 

70 The repetitive exposure process utilizes either of 
an OFF-AXIS alignment system or a TTL ON-AXIS 
alignment system. 

In the off-axis alignment system, the alignment 
pattern on a wafer is first aligned with an alignment 

75 optical system fixed outside of a projection optical 
system. The wafer is then moved accurately to under 
the projection optical system so that the alignment 
pattern on the wafer will be aligned indirectly with 
the alignment pattern on a photo-mask. In the TTL 

80 on-axis system, the alignment patterns on a wafer 
and photo-mask are simultaneously observed 
through a projection optical system such that the 
alignment patterns can directly be aligned with each 
other. 

85 The art of semiconductor elements is being unli- 
mitedly advanced toward a goal at which they are 
integrated comactly as far as possible and operated 
at a speed as high as possible. The mask aligner is 
therefore required to have higher resolving power 
90 and high degree of alignment. But, it is one of 
industrial machines so that higher productivity is 
also desired. 

Figures 1A, IB and 1 C illustrate three basic 
flowcharts of the prior art alignment and exposure 
95 systems (aligners). Figure 1 A shows the flowchart of 
an aligner utilized in the contact method, the proxim- 
ity method, the one-to-one projection method or the 
like. In such a system, a wafer is processed by a 
single alignment and a single exposure. Figure IB 
1 00 shows the flowchart of a stepper used in the off axis 
alignment system in which a wafer is processed 
through a single alignment and a plurality of expo- 
sure steps. A loop including an exposure step is 
repeatedly carried out through the number of sever- 
105 al tens to a hundred and several tens perwafer.To 
reduce time required in the repetitive exposure 
loops a stage which can be operated with high 
accuracy at high speed is provided. Such a stepper 
has an increased resolving power due to the reduc- 
110 tion projection in comparison with the aligner shown 
in Figure 1 A. The stepperalso has an increased 
accuracy of alignment since the wafer can be 
compensated in expansion and shrinkage by chang- 
ing the amount of movement in the step stage. 
1 1 5 However, since the alignment patterns on the mask 
and wafer have to be independently positioned by 
different detection systems, and then, the projected 
pattern of the mask pattern must be moved to be 
ciligned with the wafer pattern while assuring ulti- 
1 20 mate correctness of the movement. This results in 
larger number of factors of error. It is difficult to 
increase the alignment in accuracy, suppressing 
various errors. 

A system which can eliminate the above factors of 
125 error while assuring the increased resolving power 
and alignment accuracy is a stepper utilized in the 
TTL on-axis alignment system (die-by-die alignment 
system) in which the alignment patterns uii the mask 
and wafer can manually be aligned with each other 
130 through a lens system. Figure 1 C shows the flow- 
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chart of such a system in which a loop 53 includes 
steps of alignment, exposure and stage movement. 
As a result, the system requires an additional time 
which is equal to the operational time of alignment 
5 multiplied by the number of steps in comparison 
with the stepper in the off-axis alignment system. 
For the TTL on-axis system stepper, therefore, it is an 
essential proposition that time is reduced particular- 
ly in the alignment operation to assure the necessary 
10 throughput in a production machine. 

For example, when fifty exposures per wafer is 
required and if a throughput of 50 wafers/hour is 
expected, the total time afford to alignment opera- 
tion plus exposure plus stage movement should be 
1 5 equal to about one or two seconds. Reduction of 
time by 0,1 seconds corresponds to the saving of five 
seconds per wafer. The throughput can be increased 
by three of four wafers per hour. 

It is of course that the reduction of time in 
20 alignment is a common theme for all the prior art 
aligners although the degree of importance is diffe- 
rent from one another. Figure 2 shows the slightly 
detailed portion of the flowchart shown in Figure IB 
in connection with the alignment operation. If this 
25 flowchart portion is considered in respect to time 
required in the alignment operation, there are two 
significant problems one of which is a delay time t in 
Step 62. A rest time after the stage has been moved 
(time through which an oscillation produced in the 
30 system can be deemed to be stopped) is variable 
depending on the amount of movement in the stage 
or the selection of a stage to be moved, for example. 
To obviate this, a constant delay time f is provided 
by adding some safety time to the longest time of 
35 the times produced in the above situations. This 
method, however, deny the possibility of speeding- 
up. 

The other problem is the number of loops 67 
through which detections and movements are repe- 
40 atedly carried out. Necessarily, as the number of 
loops 67 is increased, loss of time is increased. 

A certain fixed relation is among fourfactors, an 
accuracy crA in AA (automatic alignment) detection, 
an accuracy aS in the movement of the stage, a 
45 acceptable-or-not discrimination (tolerance T) mean- 
ing an expected accuracy and the number of move- 
ment R (the number of re-adjustments of the stage 
until an alignment is attained). If three factors of 
them are determined, the remaining factor can 
50 substantially be determined. A single AA signal is 
insufficient to assure the necessary alignment accur- 
acy in the system so that plural of the same is taken 
and the accuracy is improved by averaging all the 
received signal detection values. If detected signal 
55 values of N in number are averaged, the accuracy is 
improved substantial 'y by a factor of \fN . If rrS is 
deemed to be an inherent value in the system and 
vjhen the tolerance T and the number of movements 
R are set as expected values, the number N of 
60 received signals can necessarily be determined. In 
tne prior art, the number N of received signals was 
set as an inherent value in the system. 

Actually, the variation in received signals is vari- 
able depending on different steps in the semi- 
65 conducto-- production process or different lots of 
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production in the same process. Accordingly, the 
variation in an averaged value of received signals of 
N in number also is variable. As a result, the number 
of movements is always varied contrary to expecta- 

70 tion so that the throughput will be influenced. On the 
other hand, the accuracy of alignment can be 
determined based on the signal detection accuracy 
aA, the accuracy of movement aS and the tolerance 
T. If the detection accuracy o-A is varied, the final 

75 accuracy of alignment also is varied. 

As will be understood from the foregoing, the 
alignment operation in the prior art is disadvan- 
tageous in that it is impossible to reduce time 
required in the alignment and yet that the expected 

80 throughout and alignment accuracy are unstably 
and always varied. 

Summary of the Invent/on 
It is an object of the present invention to eliminate 
85 the disadvantages of the prior art, to reduce time 
required in alignment operation and to assure the 
accuracy of, and the time required for, the alignment 
which are stable relative to the expected values. 
Another object of the present invention is to 
90 provide means for assuring the accuracy of, and the 
time required for, the alignment which are always 
stable independently of a difference between steps 
in the integrated circuit production process of a 
variation in alignment detection accuracy due to a 
95 difference between lots. 

Still another object of the present invention is to 
provide means for improving the throughput by 
initiating the reception of AA signals with the 
necessary minimum stand-by time for the stillness 
100 after the stage stop. 

These and other objects, features and advantages 
of the present invention will become more apparent 
upon a consideration of the following description of 
the preferred embodiment of the present invention 
105 taken in conjunction with the accompanying draw- 
ings. 

Brief description oftiie drawings 

Figure 1A is a basic flowchart of an aligner used in 
110 the contact method, proximity method and one-to- 
one projection method; 

Figure IB is a basic flowchart of an off-axis tipe 
stepper; 

Figure 7C is a basic flowchart of a TTL die-by-die 
115 tipe stepper; 

Figure 2 \s> a flowchart of the prior art alignment 
operating section; 

Figure 3 is a perspective view of one embodiment 
according to the present invention; and 
120 Figures 4A and 4B is flowcharts of the alignment 
operating section according to the present invention. 

Description of tfie preferred embodiments 

Figure 3 shows the basic construction in the TTL 

125 on-axis system together with a wafer 70 having 
alignment patterns 70a and 70b formed thereon. 
Although a set of alignment patterns are actually 
provided for each shot area, only one set of align- 
ment patterns is illustrated in Figure 3 for the 

130 simplicity of explanation. The system includes a 
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wafer stage 71 which holds the wafer 70 thereon and 
is nnoved in the directions X and Y by means of a 
drive mechanism Ml. The system also includes a 
mask stage 74 which holds a photo-mask (reticle) 73 
5 thereon and is moved in the directions X, Y and 6 by 
means of a drive mechanism M2 in two modes of 
coarse and fine movements. The photo-mask 73 
includes alignment patterns 73a, 73b and an actual 
element pattern 73c which are formed thereon. The 
10 mask stage 74 includes an unshown opening formed 
therethrough such that the projection of the align- 
ment and actual element patterns will not be dis- 
turbed. The system further includes a reduction 
projection optica! system 72. The step motion of the 
15 wafer is carried out by moving the wafer stage 71 
while the alignment between the photo-mask and 
wafer (s attained by moving the mask stage 74 or 
wafer stage 71. 

The system further includes objective lenses 75a, 
20 7bb; semi-transmission mirrors 76a, 76b; an optical 
pcJlh bpliltiny prism 77; photo-cells 78a, 78b; a 
polygonal mirror 79 and a source of laser beam all of 
which define a signal detection system. Laser beam 
from tho source 80 is scanningly deflected by 
25 rotating the polygonal mirror 79 at a constant speed. 
The laser beam is then incident on the prism 77 and 
exits leftwardly therefrom in the former half of a 
smgle scanning motion and rightwardly therefrom in 
the latter halt of the same scanning motion. Each of 
30 the laser beam portions from the prism 77 is 

lefiec.eU by the corresponding semi-transmission 
mrro' 76a or 76b to the respective objective lens 75a 
or 75h After passing through the corresponding 
oti pmvp lens 75a or75b, the laser beam portion 
35 scdPs the photo-mask 73. The laser beam portion 
tlien p Jsses through the projection lens 72 to scan 
tiK- wjfe' 70. After reflected by the alignment 
p.nnorns 70a and 70b on the wafer 70, the laser beam 
po tijdb are incident back on the projection lens 72 
4L' .:'ui w '-n incident on the photo-cells 78a and 78b 
I' K t :ti .litj objective lenses 75a, 75b and semi- 
n.! . T— ,3[on mirrors 76a, 76b together with the laser 
1 .■ ; ornons reflected by the alignment patterns 
^ 'I u ■ r>i,-ic;k When the photo-cells detect the laser 
J- riions, they generate detection signals 

,M ' I if - ir turn inputted to a control unit C. The 
V ■ jn signals are processed in accordance with 
* . A Kias decribed hereinafter with the results 
; ; ;isecl to move the stage 74 through the drive 
f ' u^ism M2 to attain an alignment between the 
vN ■ 70 and the photo-mask 73. 

At,; we the photo-mask 73 there is provided an 
iHutT, nation system [not shown) which is adapted to 
iiiuriinaiR the actual element pattern C on the 
55 Dhoi' mask when the alignment is completed and 
aher :he detection system is retracted out of the 
optK jl path in the illumination system. 

Fi^j jres 4A and 4B show flow-charts of the align- 
mer-* operating section in the embodiment ot the 
60 present invention. Each of these flowcharts corres- 
pon(Js to the respective alignment steps 4, 13 and 44 
shown in Figures 1A, IB and 1C. 

In The TTL on-axis type steppe^ the stage on wh ch 
the wafer is placed is moved to and stopped at a 
65 pos lion near the exposure position (Step 81). At the 
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same time, the detection system is actuated to 
initiate the reception of AA signals (for example, 
intervals between a plurality of bars of the alignment 
marks) until a predetermined number M of signals 
70 are received (Step 82). Subsequently, a computation 
section computes a deviation (variance) aM in the 
received detection signals of M in number (step 83). 
Actually, the reception of signals and the computa- 
tion are simultaneously carried out to successively 
75 compute a deviation o-l in the detected signals of / in 
number which have been received till that time. 

In the first reception of signals of M in number, 
there is necessarily a relative oscillation (vibration) 
between the mask and wafer since the reception is 
80 initiated immediately after the stage has been stop- 
ped. The deviation crM is therefore larger than that 
after the oscillation has been stopped. If a critical 
value aL suitable for the system has been set and is 
compared with the deviation aM (Step 84) and when 
85 the critical value is smaller than the deviation aM, 
the previous data is cleared and the next receiving 
operation for signals of M in number is initiated 
(Step 91). ■ ' 

The critical value can be determined based on 
90 experiments in which the process of attenuation in 
any oscillation produced in the whole system upon 
movement of the particular stage is examined and in 
which the influence to accuracy in detection due to 
any degree of oscillation is determined. 
95 In an other embodiment of the present invention 
shown in Figure 4B, if the acceptable-or-not discrimi- 
nation is "not", the next reception of signals is 
effected and at the same time the first data (oldest 
data) is discarded (Step 105). Consequently, the 
100 computation of deviation aM will be carried out 
based on the latest data of signals of M in number 
(Step 103). 

If the deviation o-M is compared with the critical 
value aU the date of oscillation after the movement 

105 of the stage can subsequently instantaneously be 

discriminated. If the deviation aM becomes equal to 
or smallerthan the critical value aU the procedure 
goes out of this routine 91 or 1 1 2. Thereafter, the 
procedure may use the data of M in number which 

110 have been used in the discrimination of aM ^ aLor 
may sample new data and use the same. The 
embodiment shown in Figure 4 is in the case of the 
former. 

In the next step, the next signal is received to 
115 compute aM + i <i - 1 , 2, 3 ...) in such a manner that it 
is accumulated on the previous data (Steps 86 and 
107). The total number of samples, N-M + i, used to 
compute the valun rrM + i is^tilized to compute a 
function of deviation ctN/V ^' v^hich is in turn 

120 compared with a preset critical value aT (Steps 87 
and 108). In Step 87 or 108, the next signal is 
succeedingly received along a loop 92 or 1 14 until 
the value (tN/\/ M is equal to or smaller than aT. 
The function of deviaton nN VN is exemplified fur 

1 25 such a purpose ttiat if a variation (variance) in 

detected alignment signals is varied to increase 'or 
any reason, the number of receiving operations is 
increased. If the variance is relatively small, the 
number of receiving operation is decreased. Thus, 

130 the variance in ttie average of the received data can 
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be make constant. 

Therefore, the formula o-N/VN is not necessarily 
used and the value o-N is not limited to the deviation 
if it can represent any variation in detected signals. 
5 After the procedure has passed through Step 87 or 
108 at the discrimination of ctN/VN" = crT, the 
average value of data will always have a constant 
variance. 

Although the flowcharts have been described as if 
10 the detection signals are from a single position, it is 
desirable to use two data from different positions 
(XU YL) and [XR , YR). B ased on avera ges of the 
respective data (XL, YL) and (XR, YR), the amounts of 
movement relative to target v alue s, ^X = (XL+XR)/ 
15 2, AY = {Yr+YR)/2, Ae = (YL- YR)/2 are computed 
(Steps 88 and 109). 

It is discriminated whether or not these computed 
values are acceptable, based on preset tolerance 
values (Steps 89 and 1 1 0). if they are not acceptable, 
20 the movements are executed by the given amounts 
AX, AY and Ae (Steps 90 and 111). Thereafter, the 
program again enters the loop in which signals are 
received (Steps 93 and 115). If acceptable, the 
preparation for an exposure step starts. 
25 According to the improved alignment flowcharts 
as shown in Figures 4A and 48, a reliable timing for 
starting reception of signals can be judged at 
real-time by monitoring an oscillation resulting from 
the movement of the stage to eliminate the loss of 
30 time due to the indiscriminate delay time so that the 
throughput can be improved. This advantage can be 
accomplished only by the inherent function in the 
system without any additional hardware (for exam- 
ple, new mechanism) in accordance with the present 
35 invention. 

The present invention provides another advantage 
in that stable alignment accuracy and throughput 
can always be obtained in no connected with any 
difference between steps of wafer, any difference 
40 between lots or others. 

While the invention has been described with 
reference to the structures disclosed herein, it is not 
confined to the details set forth and this application 
is intended to cover such modifications or changes 
45 as may come within the purposes of the improve- 
ments or the scope of the following claims. 

CLAIMS 

50 1. A method comprising the steps of: 

providing the relative movement between a mask 
and wafer; 

repetitively detecting a displacement between said 
mask and wafer; 

55 processing to compute a measurement represent- 
ing a variation in the detected displacements; and 
comparing said measurement with a first critical 
value relating to the degree of stillness in a system, 
and proceeding to a step for computing the amount 

60 of movement for moving said mask and wafer to 
align one with the other if said measurement is 
smaller than said critical value, and proceeding to 
instructing a re-computation for said measurement if 
said measurement is larger than said critical value. 

65 2. A method as defined in claim 1 wherein said 



re-computation for said measurement is executed by 
using a plurality of newly detected displacements 
between said mask and wafer. 

3. A method as defined in claim 1 wherein said 
70 re-computation for said measurement is executed 

using at least one newly detected displacement 
between said mask and wafer added to the previous- 
ly detected displacements and removing the pre- 
viously detected displacements equal in number to 
75 the newly added displacements. 

4. A method as defined in claim 1 wherein said 
step for computing the amount of movement for the 
alignment movement comprises the steps of detect- 
ing at least one new displacement between said 

80 mask and wafer; adding said newly detected dis- 
placement to the previously detected displacements 
with these all displacements being used to compute 
a predetermined function of deviation; and compar- 
ing the computed function of deviation with a 
85 second critical value. 

5. An apparatus comprising: 
as mask stage for moving a mask; 
a wafer stage for moving a wafer; 
step drive means for stepping one of said mask 
90 and wafer stages; 

alignment drive means for moving one of said 
mask and wafer stages to align said mask and wafer 
with each other, 

means for detecting a displacement between said 
95 mask and wafer; 

means for computing an operational value for 
operating said alignment drive means on the basis 
of said displacement; and 

means for determining the degree of stillness of 
100 the apparatus to determine the detected displace- 
ment to be imputted to said computing means. 

6. An apparatus as defined in claim 5, further 
including a projection optical system for projecting 
the image of said mask onto said wafer in a reduced 

105 scale and wherein said detecting means is adapted 
to sense said waferthrough said projection optical 
system. 

7. An apparatus as defined in claim 5 wherein 
said discriminating means includes means for com- 

1 1 0 puting a variance based on a plurality of displace- 
ments; means for comparing the variance with a first 
critical value relating to such attenuation of oscilla- 
tion in the apparatus that error will not substantially 
occurs in the computed results; and means for 
115 determining a positional error used to compute the 
ooerational value if the variance is snialier than said 
first critical value. 

8. An apparatus as defined in claim 5 wherein 
said discriminating means include?^ means for com- 
120 puting a function of deviation based on a plurality of 
displacements; means for comparing the function of 
deviation with a second critical value relating to an 
acceptable variation in signals detected by said 
detection means; and means for determining a 
1 25 displacement to be used for computing an opera- 
tional value if the function of deviation is smaller 
than said second critical value. 

9. An apparatus for achieving a desired substan- 
tially stationary positional relationship between first 
130 and second objects, comprising 
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means for producing relative movement of said 
objects such that the objects assume said positional 
relationship, 

means for detecting the relative position of the 
5 objects when moved to assume said positional 
relationship, 

means responsive to said detecting means for 
determining when said objects achieve a desired 
degree of stillness, and 
10 means for processing at least one of said objects 
when said determined desired degree of stillness is 
achieved. 

10. An apparatus according to claim 9 adapted to 
produce relative movement of a semiconductor 

15 wafer and a mask, wherein said processing means 
includes means for exposing the wafer to radiation 
through the mask. 

11. An apparatus according to claim 10 adapted 
to produce a single exposure through the mask for 

20 the wafer. 

12. An apparatus according to claim 10 having 
an off-axis alignment system. 

13. An apparatus according to claim 10 having a 
TTL alignment system. 

25 14. Apparatus for achieving a plurality of diffe- 
rent positional relationships of a mask and wafer, 
substantially as hereinbefore described with refer- 
ence to Figure 3 and 4A or B o'f the accompanying 
drawings. 
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